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ABSTRACT 

Mass segregation in a star cluster is studied in an analytical manner. We consider 
a two-component cluster, which consists of two types of stars with different masses. 
Plummer's model is used for the initial condition. We trace the overall behaviors of the 
probability distribution functions of the two components and obtain the timescale of 
mass segregation as a simple function of the cluster parameters. The result is used to 
discuss the origin of a black hole with mass of > 10 3 M found in the starburst galaxy 
M82. 

Subject headings: stellar dynamics — galaxies: star clusters — galaxies: starburst 



INTRODUCTION 



Within a star cluster, massive objects segregate into the cluster core as they lose kinetic energies 
to the less massive objects during approach toward energy equipartition. The result is an increase 
of the number density of the massive objects in the core. This phenomenon of mass segregation has 
long been known (Spitzer 1969; Binney & Tremaine 1987, p. 531) but has attracted new interests 
in recent years. The large number density induces successive mergers of those massive objects, 
which could lead to the formation of an exotic object such as a very massive black hole. 

The important parameter of mass segregation is its timescale. In order to explain an observa- 
tion in terms of mass segregation, the timescale has to be less than the age of the star cluster. 

However, mass segregation has been studied only for specific cases using numerical methods, 
i.e., Fokker-Planck, Monte Carlo, and A-body simulations. The exceptional analytical works that 
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are available are those of Spitzer (1969) and Tremaine, Ostriker, & Spitzer (1975). Spitzer (1969) 
considered an isothermal cluster of uniform density and estimated the timescale of mass segregation 
as the timescale of energy equipartition between stars of different masses. This model is too idealized 
for a star cluster, which is not isothermal or of uniform density. Tremaine et al. (1975) studied a 
decaying circular motion of an object in the singular isothermal sphere. This is originally a model 
for a motion of a globular cluster around the center of a galaxy, and it is not a model for random 
motions of stars in a cluster. Therefore, an analytical study of a more appropriate situation is 
desirable. 

We analytically study a two-component cluster, which consists of two masses mo and m (mo < 
m). The major component is the less massive stars. Their total mass Mq dominates over the 
total mass M of the more massive stars, Mq 3> M. Here the subscript is used to indicate that 
the quantity is relevant to the major component. The cluster is spherically symmetric in all its 
properties and completely isotropic in the velocity space. 

Since the timescale of mass segregation is expected to be less than the relaxation timescale 
(Spitzer 1969), the star cluster is regarded as almost collisionless. The individual stars move almost 
freely in the mean potential. The gravitational encounters occur only in a stochastic manner. Our 
study is accordingly based on the overall behaviors of the probability distribution functions (PDFs) 
of the major and minor components. They are approximated by PDFs of completely collisionless 
systems. The minor-component PDF is from a parameterized family of steady-state PDFs for which 
the degree of mass segregation is allowed to change. The major-component PDF is kept the same 
(§2). These PDFs are used to obtain the energy loss per unit time from the minor component due 
to encounters with the major component (§3). The energy loss rate yields the rate of change of the 
minor-component PDF, which in turn yields the timescale of mass segregation (§4, eq. [26]). 

We compare our result with those for relevant timescales, including the pioneering work of 
Spitzer (1969). We also use our result to discuss the origin of a black hole with mass of > 10 3 M & 
found in a young star cluster of the starburst galaxy M82 (§5). 



2. PROBABILITY DISTRIBUTION FUNCTIONS 

The major component, i.e., the less massive stars, is assumed to follow Plummer's model (poly- 
trope with n = 5; Binney & Tremaine 1987, p. 223). This steady-state solution for a collisionless 
spherical system has a simple analytic form but well reproduces observations of some globular clus- 
ters. The PDF in the phase space is determined by two parameters, e.g., the total mass Mq and 
the core radius r c : 



32^/2 3M (GM Q \ ' / , 



2\ 7/2 



with /o = at v > (2^0 )^ 2 - Here G is the gravitational constant and v is the velocity. The relative 
potential ^o is a function of the radius r and defined as the negative of the gravitational potential 
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$ < 0: 

*o(r) = -*o(r) = ^ I /2 . (2) 
r c (1 + r z /r z ) ' 



The corresponding mass density po is 

r0 ° 3M 1 

( 1 + r 2/ r 2) 

Here the integral Air J °° r 2 podr gives the total mass Mq. The squared velocity dispersion a 2 is 



Oo (r) = 47r / v 2 fodv = - — ? —rr . (3) 

F y ' Jo 4vrr 3 n + r 2/ r 2&* 



4*f~^f dv GM 1 
01 ) ^f °°v 2 f dv 2r c (1+^2)1/2" ^ ^ 

The minor component, i.e., the more massive stars, is assumed to have no contribution to 
the gravitational field of the star cluster. Then the gravitational field is determined solely by the 
relative potential of the major component. According to the Jeans theorem for a steady-state 
collisionless spherical system (Binney & Tremaine 1987, p. 221), any function of *o — v 2 /2 serves 
as the minor-component PDF. The PDF assumed in our study is from a function family that has 
a parameter (3 > 1: 



32y^ ^ 3M /GM \- (7/3+3)/2 / v 2 \ W ' 2 

— Ft Fo =■ I I I Wn — 



with fp = at v > (2\I/o) 1/2 - The factors F\ and F2 are functions of /3 defined as 

x 2 dx 1 1 



f 

Jo 
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and 
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j, j\ion- ..... _ t*_ 1 

/o 512F 2 (f))' 
where iq = F2 = 1 at j3 = 1. Their numerical values are shown in Table 1 and Figure 1. The 
corresponding mass density pp is 



3M 1 



Fl 47rr3 (1 + r 2 /r2) (7/3+3)/4' 
with the total mass M. The squared velocity dispersion <x| is 



(8) 



2 , . 12 GM 1 o, , 

^ (r) = 7^T5 1^(1 + ^,.2)1/2 ^ ff o(r). 0) 

Figure 2 shows the radial profiles of the mass density and squared velocity dispersion for (3 = 1, 
10, and 100. The total energy Ep is obtained by integrating pp[p 2 p + $0) = P/3(°/3 ~~ *o) over the 
entire volume: 

_ 3vr 2(7/3 - 1) GM M 

^ " "32 ^T^ FlFs ^^- (10) 
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The factor F3 is a function of (3 denned as 

f°° x 2 dx 7T „ , , . 

where F$ = 1 at (3 = 1. We define the initial state as //3=i so that the radial profile of mass density 
and the velocity dispersion are the same as those of the major component. With an increase of 
the (3 value, the mass density of the minor component becomes more centrally concentrated than 
that of the major component. Simultaneously, the velocity dispersion of the minor component 
becomes smaller (Fig. 2). These behaviors represent mass segregation. The radial profile of 
velocity dispersion of the minor component remains the same as that of the major component. 
This property is essential to establishing the final state of mass segregation where no net energy 
exchange exists between the two components (§3). Thus, with a single parameter (3 > 1, the 
function family /« provides the simplest model to trace the evolution of the minor component from 
the initial to final states of mass segregation. At an intermediate state, the minor-component PDF 
is set to be just one of the functions, / = or a linear combination of them, / = YKMp/M)fp 
with M = J2 M P (§ 4 )- 



3. ENERGY CHANGE RATE 

Consider a single object in the minor component, which loses kinetic energy due to gravitational 
encounters with objects in the major component. If the velocity of the single object is v, the rate 
of change of kinetic energy per unit mass is obtained as 

Ft (t)) = 167r2G2lnA (™o jH v fodv -m £^f dvo\ , (12) 

with /o = /o^o — v o/ty (Binney & Tremaine 1987, eq. [8-66]). Here (•) denotes an ensemble 
average. The cutoff factor A is the ratio of the maximum and minimum impact parameters, 6 max 
and & mm : 

Omin G(m + m ) m + m 
Here v re \ is a typical relative velocity between objects in the cluster (Binney & Tremaine 1987, p. 
511), and A^o = Mq/itiq is the total number of objects in the major component. We have assumed 
w r 2 cl ~ GM /b max = Gm N /b max . 

By multiplying equation (12) by the minor-component PDF / and integrating over the v space, 
we obtain the rate of change of kinetic energy per unit volume of the minor component. For the 
PDF / = - v 2 /2), the result is 

d fpa 2 \\ 2 19 V / 2 16H 1 



8t\ 2 )l 497T 19!! (1 + r 2/ r 2)(7/3+12)/4 
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The factor F4 is a function of (3 denned as 

£*(1-^^*=™FM, as) 

where F4 = 1 at (3 = 1. By integrating equation (14) over the entire volume, we obtain the rate of 
change of total energy E of the minor component: 

^ = - 221 ^ l!!!!EM flW5r » 

eft 3 x 5 x 7 2 19!!T(15/4) 1 2 4 5 c 

xG 2 lnA f-=L_ _ HH\ ^™ f^Y 1/2 . (16) 

The factor F§ is a function of (3 defined as 

f 00 * 2 dx y^ r(13/4) 

7o (l + x2p +12 >/ 4 15r(15/4) WJ ' li ° 

where F 5 = 1 at {3 = 1. 

Recall that gravitational encounters in a star cluster are stochastic. The energy change rates 
(12) and (14) have been obtained as ensemble averages. They are not locally defined quantities. The 
energy change rate for the entire minor component (16) has been obtained by assuming (dE/dt) = 
dE/dt because the cluster contains many stars. 

The first and second integrals in equation (12) represent heating and cooling by the major 
component, respectively. They yield the terms mo/9 and m/{7(5 + 2), respectively, in equations 
(14) and (16). When the two terms balance, the total energy of the minor component settles to an 
equilibrium, dE/dt = 0. Hence the (3 value corresponding to the final state of mass segregation is 

^ ~ 7m ■ (18) 

This relation does not imply equipartition of kinetic energies, ma 2 = mo<Tg (see eqs. [4] and [9]), 
because the major component is kept to follow Plummer's nonthermal model. The condition for 
energy equipartition in an equilibrium state is that each of the two components has a thermal 
velocity distribution (Chapman & Cowling 1970, p. 80). A qualitatively consistent result for very 
large values of m/rriQ was obtained by Chatterjee, Hernquist, & Loeb (2002). 1 



1 Chatterjee et al. obtained the squared velocity dispersion a 2 at r — » for a Brownian motion of a single massive 
object that is in equilibrium with the background stars. Their study is mathematically equivalent to ours because 
we have ignored encounters among objects in the minor component. The background stars were assumed to follow 
Plummer's model as in our present study. 
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4. TIMESCALE OF MASS SEGREGATION 

By using the energy change rate dE/dt in equation (16), we estimate the timescale of mass 
segregation r ms . This timescale is defined in terms of the secular increase of the mass of the minor 
component, i.e., the more massive stars, in the cluster core M c : 



dM c /dt 

with 



(19) 



t=o 



4vr I C r 2 pdr = — !-p-M, (20) 
Jo 2V2 

for f = fp. The factor Fq is a function of (3 defined as 

x 2 dx 1 , s , s 

Fe(J3), (21) 



where F 6 = 1 at {3 = 1. 

Since it is not analytically determined how the minor-component PDF changes in the course 
of mass segregation, we study two extreme cases. The one is that the minor-component PDF 
is characterized by a single (3 value, which increases from the initial value of 1 toward the final 
value /? fin , i.e., / = fp with /?=latt = 0(l</3< /3 fin ). Equations (6), (20), and (21) 
yield dM c /df3, while equations (6), (10), and (11) yield dE/df3. Then dM c /dt is obtained as 
(dM c /d(3){df3/dt) = (dM c /d(3)(dE/dt)/(dE/d(3). Using their initial values, we obtain 



3 3 x 5 x 7 2 7r^Fl9!!r(15/4) 7-81n2 

n2- 2y/2 

(22) 



2 23 ^2 16!! r(13/4) 8 ln(l + y/2) - 3V2ln 2 - 2^2 
1 1 (3M \~ VGM n ^ 3/2 



c 



G 2 In A m — mo \ J \ 2r, 

The numerical factor is about 0.3878. 

The other extreme case is that the state (3 = 1 evolves directly into the state (3 = (3^ n . 
These two states serve as the bases for the minor-component PDF, i.e., / = (Mp = i/M)fp = i + 
(Mp = p R JM)fp = p fin = (1 - 7 )fp =l + 7/ / 3= /3fln with 7 = at t = (0 < 7 < 1). The core mass M c 
and the total energy E of the minor component are 

M c = (l- 7 )^=M + 1 M, (23) 

^ T-i n ,3vr GM M GM M 

^ = - (1 " 7) 32^ 7 ^"— (24) 

Here we have made approximations for the state (3 = /3fi n taking account of the fact that the minor 
component is well concentrated in the core and has a negligible velocity dispersion. Then dM c /dt is 
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obtained as (dM c /dj)(d'y/dt) = (dM c /d / y)(dE/dt)/(dE /dj). Using their initial values, we obtain 

3 2 x 5 x 7 2 ^ 19!! T(15/4) 32 - 3^ 



2 23 16!! r(13/4) 2^2-1 
1 1 /3M \ _1 (GM \ 3/2 

G 2 In A m — rriQ \ 4nr^ J \ 2r c 



(25) 



The numerical factor is about 0.6331. The present case has a somewhat larger timescale than the 
previous case because the effective energy difference is somewhat larger, i.e., dE/d^f < dE/d(3 < 
while dMc/d'y ~ dM c /d(3 in the initial state. 

Mass segregation in a real star cluster is expected to be intermediate between the above two 
cases. According to Fokker-Planck and Monte Carlo simulations of early stages of mass segregation 
(Inagaki & Wiyanto 1984; Fregeau et al. 2002), the ratio of velocity dispersions cr/ao averaged over 
the whole cluster does not change appreciably, while that averaged over the core alone approaches 
steadily to the final value. The reason is that gravitational encounters occur preferentially among 
stars on orbits that pass through the dense core. Although this fact has been ignored in our 
present calculation, the behavior of <j/<Jq would be reproduced if the minor-component PDF were 
a superposition of various states between those of (3 = 1 and (5 = (5^ n with the average (5 value 
increasing with the time. 

Equations (22) and (25) have the same dependence on physical quantities. The values of the 
numerical factors are also similar to each other and well approximated to be 0.5 ±0.1 or 2 _1 . Thus 
the timescale of mass segregation is 

1 1 J (SMpy 1 (GM \ 3 / 2 

Tms 2G 2 lnAm-m \4mi) \ 2r c J ' 1 j 

where 3Mo/4irr 3 and (GMo/2r c ) 1 ^ 2 correspond to the central mass density po(0) and central ve- 
locity dispersion o"o(0) of the major component, respectively (eqs. [3] and [4]). 



5. DISCUSSION 

5.1. Comparison with Relevant Timescales 

The timescale of mass segregation is greater than the core crossing timescale. Since the core 
radius is r c and the central velocity dispersion is (GMo/2r c ) 1 / 2 , the core crossing timescale r cc is 

2r 3 xi/2 

c ^ . (27) 



GM 

Then equation (26) for the timescale of mass segregation r ms is rewritten as 

7r moiVo/(m - mo 



6 In [m iVo/ (m + m )] 



r cc > r cc , (28) 
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where we have assumed mo/m ~ 10 _1 -10~ 2 and — 10 5 -10 6 as a practical situation. On the 
other hand, the timescale of mass segregation is less than the relaxation timescale t t \ x of the major 
component, which is about Nq/8\iiNq times the crossing timescale r cc (Binney & Tremaine 1987, 
p. 190): 

Trlx ~ 8 ml|; Tcc > Tms ' (29) 
These results justifies our assumption that the star cluster is almost collisionless and the major- 
component PDF remains the same during mass segregation. 

The timescale of mass segregation is close to the timescale of energy loss from the minor 
component, T e i = E/(dE/dt) at t = 0. Using equations (10) and (16), we obtain 

3 3 x 5 x 7 2 7ry5F l9!! r(15/4) 1_ _L /3Mo\ _1 /GMp_\ 3/2 

Tel 2 23 16!!r(13/4) G 2 lnAm-m J \ 2r c J ' { } 

The dependence on physical quantities is the same as that of equation (26). This is because the 
energy loss from the minor component and the increase of its central concentration are of the 
same phenomenon. The numerical factor of equation (30) is about 0.4832, which happens to be 
in agreement with the numerical factor 2 _1 of equation (26). Since this energy loss timescale is 
obtained without any assumption on the evolution of the minor-component PDF, it is a robust 
measure. 



5.2. Comparison with Spitzer (1969) 

Spitzer (1969) considered an isothermal cluster of uniform density and estimated the timescale 
of mass segregation as the timescale of approach toward energy equipartition between the major 
and minor components, 2 which was about mo/m times the relaxation timescale r r i x of the major 
component. Using equation (29), we obtain the corresponding timescale in our model cluster: 

^ 1 m N /m 

T ee (Spitzer) — g i n jy Q Tcc- \ d ' 

If m > mo, this energy equipartition timescale is by definition equivalent to our energy loss 
timescale r e \ in equation (30). Since the numerical factor is 2 _1 , the equation is rewritten as 

7T moN /m 

Tel ~ 6 In (m AV m) cc ' 1 ' 

where we have replaced m ± m with m. Except for an unimportant difference in the argument of 
the logarithm, the timescales (31) and (32) have the same dependence on physical quantities. On 
the other hand, the numerical factors are different. This difference is not serious because there is 



2 Defined as the time it takes for the difference between ma 2 and mocrg to decrease by a factor of e 1 . 
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ambiguity in the definition of the relaxation timescale. Our relaxation timescale (29) is based on 
the core crossing timescale r cc and might not be sufficiently large to yield the energy equipartition 
timescale for the whole cluster. Overall, although Spitzer (1969) considered a very idealized cluster, 
his timescale is expected to be valid for energy equipartition in a real star cluster at least as an 
order-of-magnitude estimation. This is also the case for mass segregation, which occurs during 
approach toward energy equipartition. 

Spitzer (1969) also predicted that, if the total mass of the minor component is much less 
than that of the major component, M <C Mq, the two components eventually achieve energy 
equipartition, ma 2 = tjiqCq. His prediction has been confirmed at least for the cluster core by 
Fokker-Planck and Monte Carlo simulations (Inagaki Sz Wiyanto 1984; Watters, Joshi, & Rasio 
2000: Fregeau et al. 2002). Since it is required that both the major and minor components evolve 
to have thermal velocity distributions (§3), the energy equipartition is achieved at t > r r i x . 3 This 
is beyond the limit of our model (t < r r i x ), where the cluster is assumed to be almost collisionless. 
The major component is kept to follow Plummer's nonthermal model and hence does not achieve 
energy equipartition with the minor component, regardless of the total mass ratio M/Mq. This 
fact is not serious to our estimation of the timescale of mass segregation, which is less than the 
relaxation timescale r r i x and has been defined at the initial time of the evolution t = in equation 
(19). 



5.3. Application to Observations 

If the central mass density po(0) and central velocity dispersion o"o(0) have values typical of 
globular clusters (Binney & Tremaine 1987, p. 26, see also p. 423 for the cutoff factor A), equation 
(26) is written as 

7 /lnAV 1 / m VV MO) VV <tq(0) V 

Tms = 3xl0yr V^J {towJ U 4 M 0P c-3 j [m^) ■ (33) 

Here we have assumed m 3> ttiq and replaced m — mo with m. If we instead use the typical values 
of the core radius r c and central velocity dispersion <r (0), both of which are observable quantities, 
equation (26) is rewritten as 

Thus mass segregation in a typical star cluster has the timescale T ms of order 10 7 yr. 



3 This energy equipartition in the cluster core is not exact. The difference between ma 2 and maa% is very small 
but still existent (> f%; Inagaki & Wiyanto 1984; Watters et al. 2000). The reason is that the velocity distributions 
do not become exactly thermal even in the cluster core. 
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The above timescale of mass segregation is used to discuss the origin of a very massive black 
hole found in the starburst galaxy M82 (Kaaret et al. 2001; Matsumoto et al. 2001; see also 
Matsushita et al. 2000). At the 2 /im secondary peak, i.e., an active site of star formation, there 
is a source of compact X-ray emission. The observed strong variability implies that the source is 
an accreting black hole. The observed luminosity of 10 41 ergss" 1 implies that the mass is greater 
than 10 3 M Q if the emission is isotropic and its luminosity is below the Eddington limit. This black 
hole should have been formed via successive mergers of massive stars and black holes that had 
segregated into the core of a star cluster (Taniguchi et al. 2000). Since the 2 secondary peak 
has a starburst age of about 1 x 10 7 yr (Satyapal et al. 1997), the mass segregation should have 
occurred well within this short duration. The timescales given in equations (33) and (34) are too 
long. We suspect that the star cluster is exceptionally dense and compact. For example, the young 
star cluster R136 in the Large Magellanic Cloud has po(0) — 10 6 M Q pc~ 3 , <to(0) ~ 10 1 kms _1 , 
and r c ~ 10 -2 — 10 _1 pc (Portegies Zwart et al. 1999), which yields r ms ~ 10 5 yr. In fact, only one 
black hole heavier than 10 3 Mq has been found among more than 100 young star clusters of M82. 
The objects that had segregated into the cluster core should have merged in a runaway manner 
(Quinlan h Shapiro 1989; Portegies Zwart et al. 1999; Mouri & Taniguchi 2002) because the merger 
probability is higher for more massive objects. 

This work has been supported in part by the Japanese Ministry of Education, Science, and 
Culture under grants 10044052 and 10304013. We are grateful to N. Kanaeda and the referee for 
helpful comments. 
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FIGURE CAPTIONS 

Fig. 1. — Solid lines: Numerical values of F\, F 2 , F3, F4, F$, and Fq. Dotted lines: Numerical 
values of F\F 2 , FiF 3 , FiF 2 F 4 , F1F2F4F5, and FiFq. The abscissa is the parameter (5. 

Fig. 2. — Solid lines: Radial profiles of the mass density pp in equation (8) for (3 = 1, 10, and 
100. Dotted lines: Radial profiles of the squared velocity dispersion in equation (9). The 
abscissa is the radius r normalized by the core radius r c . For convenience, we have assumed 
3M/47rrj? = GM /2r c = 1. 



This preprint was prepared with the AAS IATJ5X macros v5.0. 
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Table 1. NUMERICAL VALUES OF THE FACTORS F±-Fq 



(3 


Fi 


F 2 


^3 


F A 


^5 


^6 


1 


1.000 


1.000 


1.000 


1.000 


1.000 


1.000 


2 


3.875 


2.294 


0.3469 


0.6177 


0.5424 


0.5480 


3 


7.761 


3.902 


0.1881 


0.4292 


0.3519 


0.3325 


5 


17.81 


7.868 


0.08728 


0.2508 


0.1914 


0.1573 


10 


52.71 


21.15 


0.03083 


0.1081 


0.07705 


0.05366 


20 


152.4 


58.30 


0.01090 


0.04263 


0.02921 


0.01856 


30 


282.1 


106.2 


0.005933 


0.02412 


0.01628 


0.01003 


50 


610.4 


226.8 


0.002757 


0.01157 


0.007717 


0.004633 


100 


1734. 


638.1 


0.0009748 


0.004189 


0.002769 


0.001631 
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